Search CORE

858 research outputs found

TGF-beta signaling proteins and the Protein Ontology

Author: Cathy Wu
Cecilia Arighi
Darren Natale
Harold Drabkin
Hongfang Liu
Judith Blake
Smith Barry
Winona Barker
Publication venue
Publication date: 01/01/2009
Field of study

The Protein Ontology (PRO) is designed as a formal and principled Open Biomedical Ontologies (OBO) Foundry ontology for proteins. The components of PRO extend from a classification of proteins on the basis of evolutionary relationships at the homeomorphic level to the representation of the multiple protein forms of a gene, including those resulting from alternative splicing, cleavage and/or posttranslational modifications. Focusing specifically on the TGF-beta signaling proteins, we describe the building, curation, usage and dissemination of PRO. PRO provides a framework for the formal representation of protein classes and protein forms in the OBO Foundry. It is designed to enable data retrieval and integration and machine reasoning at the molecular level of proteins, thereby facilitating cross-species comparisons, pathway analysis, disease modeling and the generation of new hypotheses

PhilPapers

Toll-like receptor signaling in vertebrates: Testing the integration of protein, complex, and pathway data in the Protein Ontology framework

Author: Arighi Cecilia
D’Eustachio Peter
Masci Anna Maria
Natale Darren
Ruttenberg Alan
Shamovsky Veronica
Smith Barry
Wu Cathy
Publication venue
Publication date: 01/01/2015
Field of study

The Protein Ontology (PRO) provides terms for and supports annotation of species-specific protein complexes in an ontology framework that relates them both to their components and to species-independent families of complexes. Comprehensive curation of experimentally known forms and annotations thereof is expected to expose discrepancies, differences, and gaps in our knowledge. We have annotated the early events of innate immune signaling mediated by Toll-Like Receptor 3 and 4 complexes in human, mouse, and chicken. The resulting ontology and annotation data set has allowed us to identify species-specific gaps in experimental data and possible functional differences between species, and to employ inferred structural and functional relationships to suggest plausible resolutions of these discrepancies and gaps

PhilPapers

Directory of Open Access Journals

PubMed Central

University of Delaware Library Institutional Repository

A domain ontology for the non-coding RNA field

Author: Blake Judith A.
Dou Dejing
Eilbeck Karen
Huang Jingshan
Jiang Guoqian
Lin Yu
Natale Darren A.
Ruttenberg Alan
Smith Barry
Zimmermann Michael T.
Publication venue
Publication date: 01/01/2015
Field of study

Identification of non-coding RNAs (ncRNAs) has been significantly enhanced due to the rapid advancement in sequencing technologies. On the other hand, semantic annotation of ncRNA data lag behind their identification, and there is a great need to effectively integrate discovery from relevant communities. To this end, the Non-Coding RNA Ontology (NCRO) is being developed to provide a precisely defined ncRNA controlled vocabulary, which can fill a specific and highly needed niche in unification of ncRNA biology

PhilPapers

The representation of protein complexes in the Protein Ontology

Author: Arighi Cecilia
Blake Judith
Bult Carol
Drabkin Harold
D’Eustachio Peter
Evsikov Alexei
Natale Darren
Roberts Natalia
Ruttenberg Alan
Smith Barry
Wu Cathy
Publication venue
Publication date: 01/01/2011
Field of study

Representing species-specific proteins and protein complexes in ontologies that are both human and machine-readable facilitates the retrieval, analysis, and interpretation of genome-scale data sets. Although existing protin-centric informatics resources provide the biomedical research community with well-curated compendia of protein sequence and structure, these resources lack formal ontological representations of the relationships among the proteins themselves. The Protein Ontology (PRO) Consortium is filling this informatics resource gap by developing ontological representations and relationships among proteins and their variants and modified forms. Because proteins are often functional only as members of stable protein complexes, the PRO Consortium, in collaboration with existing protein and pathway databases, has launched a new initiative to implement logical and consistent representation of protein complexes. We describe here how the PRO Consortium is meeting the challenge of representing species-specific protein complexes, how protein complex representation in PRO supports annotation of protein complexes and comparative biology, and how PRO is being integrated into existing community bioinformatics resources. The PRO resource is accessible at http://pir.georgetown.edu/pro/

PhilPapers

Computational identification of strain-, species- and genus-specific proteins

Author: Mazumder Raja
Murthy Sudhir
Natale Darren A
Thiagarajan Rathi
Wu Cathy H
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: The identification of unique proteins at different taxonomic levels has both scientific and practical value. Strain-, species- and genus-specific proteins can provide insight into the criteria that define an organism and its relationship with close relatives. Such proteins can also serve as taxon-specific diagnostic targets. DESCRIPTION: A pipeline using a combination of computational and manual analyses of BLAST results was developed to identify strain-, species-, and genus-specific proteins and to catalog the closest sequenced relative for each protein in a proteome. Proteins encoded by a given strain are preliminarily considered to be unique if BLAST, using a comprehensive protein database, fails to retrieve (with an e-value better than 0.001) any protein not encoded by the query strain, species or genus (for strain-, species- and genus-specific proteins respectively), or if BLAST, using the best hit as the query (reverse BLAST), does not retrieve the initial query protein. Results are manually inspected for homology if the initial query is retrieved in the reverse BLAST but is not the best hit. Sequences unlikely to retrieve homologs using the default BLOSUM62 matrix (usually short sequences) are re-tested using the PAM30 matrix, thereby increasing the number of retrieved homologs and increasing the stringency of the search for unique proteins. The above protocol was used to examine several food- and water-borne pathogens. We find that the reverse BLAST step filters out about 22% of proteins with homologs that would otherwise be considered unique at the genus and species levels. Analysis of the annotations of unique proteins reveals that many are remnants of prophage proteins, or may be involved in virulence. The data generated from this study can be accessed and further evaluated from the CUPID (Core and Unique Protein Identification) system web site (updated semi-annually) at . CONCLUSION: CUPID provides a set of proteins specific to a genus, species or a strain, and identifies the most closely related organism

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The Non-Coding RNA Ontology : a comprehensive resource for the unification of non-coding RNA biology

Author: Alan Ruttenberg
Blake Judith A.
Dejing Dou
Jingshan Huang
Jun Huan
Karen Eilbeck
Natale Darren A.
Smith Barry
Weili Huang
Zimmermann Michael T.
Publication venue
Publication date: 01/01/2016
Field of study

In recent years, sequencing technologies have enabled the identification of a wide range of non-coding RNAs (ncRNAs). Unfortunately, annotation and integration of ncRNA data has lagged behind their identification. Given the large quantity of information being obtained in this area, there emerges an urgent need to integrate what is being discovered by a broad range of relevant communities. To this end, the Non-Coding RNA Ontology (NCRO) is being developed to provide a systematically structured and precisely defined controlled vocabulary for the domain of ncRNAs, thereby facilitating the discovery, curation, analysis, exchange, and reasoning of data about structures of ncRNAs, their molecular and cellular functions, and their impacts upon phenotypes. The goal of NCRO is to serve as a common resource for annotations of diverse research in a way that will significantly enhance integrative and comparative analysis of the myriad resources currently housed in disparate sources. It is our belief that the NCRO ontology can perform an important role in the comprehensive unification of ncRNA biology and, indeed, fill a critical gap in both the Open Biological and Biomedical Ontologies (OBO) Library and the National Center for Biomedical Ontology (NCBO) BioPortal. Our initial focus is on the ontological representation of small regulatory ncRNAs, which we see as the first step in providing a resource for the annotation of data about all forms of ncRNAs. The NCRO ontology is free and open to all users

PhilPapers

Community annotation in biology

Author: Julio Jessica Anne Ecalnir
Mazumder Raja
Natale Darren A
Wu Cathy H
Yeh Lai-Su
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Attempts to engage the scientific community to annotate biological data (such as protein/gene function) stored in databases have not been overly successful. There are several hypotheses on why this has not been successful but it is not clear which of these hypotheses are correct. In this study we have surveyed 50 biologists (who have recently published a paper characterizing a gene or protein) to better understand what would make them interested in providing input/contributions to biological databases. Based on our survey two things become clear: a) database managers need to proactively contact biologists to solicit contributions; and b) potential contributors need to be provided with an easy-to-use interface and clear instructions on what to annotate. Other factors such as 'reward' and 'employer/funding agency recognition' previously perceived as motivators was found to be less important. Based on this study we propose community annotation projects should devote resources to direct solicitation for input and streamlining of the processes or interfaces used to collect this input. Reviewers This article was reviewed by I. King Jordan, Daniel Haft and Yuriy Gusev</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Protein Ontology: A controlled structured network of protein entities

Author: Arighi Cecilia N.
Blake Judith A.
Bult Carol J.
Christie Karen R.
Diehl Alexander D.
Drabkin Harold J.
Julie Cowart
Natale Darren A.
Olivia Helfer
Others
Peter D’Eustachio
Smith Barry
Publication venue
Publication date: 01/01/2013
Field of study

The Protein Ontology (PRO; http://proconsortium.org) formally defines protein entities and explicitly represents their major forms and interrelations. Protein entities represented in PRO corresponding to single amino acid chains are categorized by level of specificity into family, gene, sequence and modification metaclasses, and there is a separate metaclass for protein complexes. All metaclasses also have organism-specific derivatives. PRO complements established sequence databases such as UniProtKB, and interoperates with other biomedical and biological ontologies such as the Gene Ontology (GO). PRO relates to UniProtKB in that PRO’s organism-specific classes of proteins encoded by a specific gene correspond to entities documented in UniProtKB entries. PRO relates to the GO in that PRO’s representations of organism-specific protein complexes are subclasses of the organism-agnostic protein complex terms in the GO Cellular Component Ontology. The past few years have seen growth and changes to the PRO, as well as new points of access to the data and new applications of PRO in immunology and proteomics. Here we describe some of these developments

PhilPapers

The development of non-coding RNA ontology

Author: Blake Judith
de Silva Nisansa
Dou Deijing
Eilbeck Karen
Huan Jun
Huang Jingshan
Huang Weili
Jiang Guoqian
Kasukurthi Mohan Vamsi
Lin Yu
Natale Darren
Ruttenberg Alan
Smith Barry
Strachan Harrison
Wu Bin
Zimmermann Michael
Publication venue
Publication date: 01/01/2016
Field of study

Identification of non-coding RNAs (ncRNAs) has been significantly improved over the past decade. On the other hand, semantic annotation of ncRNA data is facing critical challenges due to the lack of a comprehensive ontology to serve as common data elements and data exchange standards in the field. We developed the Non-Coding RNA Ontology (NCRO) to handle this situation. By providing a formally defined ncRNA controlled vocabulary, the NCRO aims to fill a specific and highly needed niche in semantic annotation of large amounts of ncRNA biological and clinical data

PhilPapers